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Abstract 

We construct a new quantum algorithm for the graph collision problem; that is, the problem of 
deciding whether the set of marked vertices contains a pair of adjacent vertices in a known graph G. 
The query complexity of our algorithm is 0(\/n + \Ja*(G)), where n is the number of vertices 
and a*(G) is the maximum total degree of the vertices in an independent set of G. Notably, if G is a 
random graph where every edge is present with a fixed probability independently of other edges, then 
our algorithm requires 0(y/n log n) queries on most graphs, which is optimal up to the \/log n factor 
on most graphs. 

1 Introduction 

The quantum query complexity of a function is a natural counterpart of its classical query complexity; 
namely, it is the number of quantum oracle calls that an algorithm has to make in order to compute 
its value. Graver's search algorithm [Gro96 1 gave the first example of a function whose quantum query 
complexity is significantly smaller than classical: computing the value of the OR function VILi x i requires 
O(n) classical queries, but only 0{y/n) quantum queries. 

It has been shown by Bennett et al. MBBBV971 that Graver's algorithm is optimal for computing the 
OR function. Later, Beals et al. ||BBC + Qlt proved that no total function can have the gap between its 
quantum and classical query complexities larger than polynomial. 

Another important problem where a quantum query algorithm can be much faster than a classical 
one is the element distinctness problem, where n elements are "colored" by an oracle and the goal is to 
decide whether there are at least two elements of the same color. In 2001 Buhrman et al. fBD H+051 
constructed an algorithm that required (3(n 3//4 ) quantum queries, later a lower bound of 0(n 2//3 ) was 
shown by Aaronson and Shi HAS041 . Finally, in 2003 Ambainis [Amb07] gave a new algorithm that had 
query complexity 0(n 2 / 3 ), thus matching the lower bound. 

The graph collision problem was first considered by Magniez et al. HMSS07L where it was shown to 
have quantum query complexity 0(n 2//3 ). The algorithm used in [MSS07] can be viewed as a natural 
adaptation of Ambainis' algorithm for the element distinctness problem. On the other hand, the lower 
bound techniques used in [AS041 don't seem to be applicable to the graph collision problem, and the 
actual quantum query complexity of it is still an open question. 

1.1 Our results and techniques 

We present a new quantum algorithm for the graph collision problem. The complexity of our algorithm 
depends on the properties of the given graph G. Throughout the paper, the quantum query complexity of 
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a decision problem refers to that with a constant two-sided error. 

Theorem 1. For a graph G on n vertices, the quantum query complexity of the graph collision problem 
on G is 0(y/n + a*(G)), where a*(G) is the maximum total degree of the vertices in an independent 
set of G. 

Notably, this implies that the graph collision problem requires only 0{y/n) quantum queries for most 
graphs in the following sense. Let // n)P be the distribution corresponding to choosing a graph on n vertices, 
where every edge is present with probability p independently of other edges. 

Corollary 2. For arbitrary function p: N — > [0, 1], let G ~ fJ- n ,p(n)- Then the (worst-case) quantum query 
complexity of the graph collision problem on G is almost alwaySi} 0(y/n log n). 

The above result is optimal up to the \/logn factor for most random graphs, as computing the OR 
of n variables can be reduced to solving the graph collision problem on any graph G that contains Q(n) 
non-isolated vertices. 

Our algorithm for Theorem Q] works as follows. As a preprocessing, we estimate the sum of the 
degrees of the vertices in S. If this sum is much larger than max{a*(G), n}, then we answer "5 is not an 
independent set" and halt. This requires 0(y/n) queries, due to the approximate counting algorithm by 
Brassard, H0yer, and Tapp [BHT98 ]. To handle the remaining (main) case, we construct a span program 
with witness size 0(y/n + \f a*(G)). It was shown by Reichardt [Rei09l lReillH that the quantum query 
complexity of a promise decision problem is at most a constant factor away from the witness size of a 
span program computing it. 

1.2 Related work 

Magniez, Santha, and Szegedy [MSS07| introduced the graph collision problem and gave a quantum 
algorithm with 0(n 2 / 3 ) queries. They used it as a subroutine used in their O(n 13 / 10 )-query algorithm 
for the triangle finding problem. This 0(n 2 / 3 ) is the best known upper bound on the quantum query 
complexity of the graph collision problem. The best known lower bound for the graph collision problem 
is £l(^Jn), which follows easily from the lower bound for the search problem [BBBV97]. Jeffery, Kothari, 
and Magniez [JKM12] recently gave a quantum algorithm for the graph collision problem on a bipartite 
graph which is useful when the given bipartite graph is close to the complete bipartite graph: the query 
complexity of their algorithm is 0{yfn + \/m), where m is the number of missing edges compared to the 
complete bipartite graph. 

Improving the query complexity of the graph collision problem has important consequences. First, 
improving it is likely to give a better algorithm for the triangle finding problem by applying the same 
technique as the one used in Ref. [MSS07]. Second, the graph collision problem is equivalent to the 
evaluation of a 2-DNF formula, and the techniques used in the graph collision problem may be also 
applicable to the more general fc-DNF evaluation. 

Our algorithm for the main case of the graph collision problem, including its use of span programs, is 
inspired by the recent result by Belovs and Lee HBL111 . 

'Cf. Theorem[9]foi' the corresponding quantitative statement. 
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2 Preliminaries 



We will consider the quantum query complexity of the following problem. 

Definition 1 (Graph collision problem). Let G = (V, E) be a graph. The graph collision problem on G 
asks, given oracle access to a string x G {0, 1} V , whether there exists an edge G E such that Xi = 



1. 



Note that graph G is given explicitly to the algorithm, and the only part of the input which needs to 
be queried is the string x G {0, 1} V ■ We call a vertex i marked if X{ = 1. Note that the graph collision 
problem is equivalent to deciding whether the marked vertices form an independent set in G, with the 
answers "yes" and "no" swapped. 

In the rest of the paper, we let V = [n], where [n] denotes the set {1, . . . , n}. For a graph G = {V,E) 
and a set S C V of vertices, we denote by deg(5) the sum of degrees of vertices in S. For any graph G, 
we denote by a* (G) the maximum total degree of the vertices in an independent set of G; that is, 

a*{G) = max{deg(S) : S is an independent set in G}. 

We will use the following form of Chernoff bound, as stated by Drukh and Mansour [DM05 ]. 

Lemma 3 (Chernoff bound). Let X\ , . . . , X n be mutually independent random variables taking values 
in [0, 1], such that E [Xi] = p, for all i G [n]. Then for any A > 1, 
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Xi > Xn/j, 

ie[n] 

All logarithms in this paper are natural 



n(A — l) 2 //\ 
< exp ( ; ] < exp(^(3 — Xjn/j,). 
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2.1 Span programs 

Span program is a linear-algebraic model of computation introduced by Karchmer and Wigderson [KW93 ] 
to study the computational power of counting in branching programs and space-bounded computa tion. In 
our context, the relevant complexity measure is its witness size introduced by Reichardt and Spalek [RS08|. 
We use a formulation closer to that used by Reichardt [Rei09 |. 

Definition 2 (Span program). A span program P = (H,\t); V±o, Vu, . . . , V n o, V n i) with n-bit input is 
defined by a finite-dimensional Hilbert space % over C, a vector \t) G H, and a finite set Vj\, C % for 
each j G [n] and each b G {0, 1}. This span program is said to compute a function /: D — > {0, 1}, 
where D C {0, l} n , when for x G D, we have f(x) = 1 if and only if \t) lies in the subspace of % 
spanned by UjeW ^ ne vector I*) i s called the target vector of this span program P. 

Definition 3 (Witness size of a span program). Let P be a span program as in Definition [2] 

• For an input x G a witness for x is an n-tuple of mappings w\, . . . , w n , where Wj : Vj Xj — > 

C, such that \t) = Ylje[n] Yl\v)eV x w j(\ v ))\ v )- The witness size on input x G denoted 
by wsize(P, x), is defined as 
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wsize(P,x)= min |wj(|t>))|' 

(wi ,...,iii n ):witness for x in P 

jein}\v)£V jx . 
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• For an input x G / _1 (0), a witness for x is a vector |«/) G % such that (i|u>') = 1 and (u |«/) = 
for every G UjeM Vjxj- This time, the witness size on input x G / -1 (0), again denoted 
by wsize(P, x), is defined as 

wsize(P, x) = min |(?j|u/)| 2 . 

|ui'):witness for a; in P f-~* 

je[n},be{0,l}\v)eV jb 

• The witness size of this span program is wsize(P) = max ie £> wsize(P, x). 

• Finally, we denote by wsize(/) the minimum witness size of a span program which computes /. 

For a function / : D — > {0, 1}, where D C {0, l} n , we denote by Q(/) the quantum query complexity 
of / with two-sided error probability at most 1/3. As is well known, changing the error probability to 
other constants less than 1/2 affects the query complexity only within a constant factor. 

Theorem 4 (Reichardt HRei09IIReillTn . Let f : D ->■ {0, 1}, where D C {0, l} n . 7%en Q(/) W wsize(/) 
coincide up to a constant factor. That is, there exists a constant c > 1 which does not depend on n or f 
such that (1/c) wsize(/) < Q(f) < c • wsize(/). 

Proof. Ref. [Rei09] showed that wsize(/) is equal to the general adversary bound for /, and Ref. MReilll 
showed that the general adversary bound for / and Q(f) coincide up to a constant factor. □ 

2.2 Quantum algorithm for approximate counting 

To detect the case where marked vertices have too many edges, we will use the following result by Bras- 
sard, H0yer, and Tapp [BHT98, Theorem 5]. 

Theorem 5 (Approximate counting [BHT98]). There exists a quantum algorithm which, given inte- 
gers N > 1 (domain size) and P > 4 (precision) and oracle access to a function F: [N] — > {0, 1} 
satisfying t = \F~ l (l)\ < N/2, makes P queries to the oracle and outputs an integer t, such that 

, 27T r— it 2 Ar 
\t-t < —VtN + —jN 
P P A 

with probability at least 8/ir 2 . 

We can remove the assumption that t < N/ 2 by doubling the size of the domain, and we can reduce 
the error probability to an arbitrarily small constant by repeating the algorithm constantly many times and 
taking the majority vote: 

Corollary 6. Let e > be a constant. Then there exists a quantum algorithm which, given integers N > 1 
(domain size) and P > 4 (precision) and oracle access to a function F: [N] — > {0, 1}, makes 0{P) 
queries to the oracle and outputs an integer t satisfying the following. Let t = \F~ 1 (1)\. Then it holds 
that 

2V2 it r— 2vr 2 



\t-t\<^^VtN +T ^N 

with probability at least 1 — e. (The constant factor hidden in the O-notation of the number of queries 
depends only on e and not on N, P, or F.) 
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3 Algorithm for the main case 

The following lemma is useful in the case where not too many edges are incident to marked vertices. In 
the next section, we will use it with k = 2max{n, a*(G)} to prove TheoremQ] 

Lemma 7. Let G be a graph on n vertices, and let k G N. Consider the special case of the graph collision 
problem on G where it is promised that the set S of marked vertices satisfies deg(S') < k. 

(i) There exists a span program for this promise problem whose witness size is at most \j2{n + k). 

( ii) There exists a quantum algorithm for this promise problem with two-sided error probability at 
most 1/6 whose query complexity is 0{y/n + Vk). 

Proof. Item (ii) follows immediately from item (i) and Theorem |4] In the rest of the proof, we will prove 
item (i) by constructing a span program explicitly. 



For j G [n] and b G {0, 1}, let \s jb ) = J2ze{o,i}": Zj =b\ z )- Let v j0 = an d Vji = {\sjo)} U {\s a ) : i G 
N(j)}, where N(j) is the set of neighbors of vertex j in graph G. Define a span program P as P = 

(n,\ty,v 10 ,v 11 ,...,v n0 ,v nl ). 

It is easy to see that P computes the promise problem stated in the lemma. Indeed, if x G / _1 (1), then 
there exists an edge ij G E such that Xi = Xj = 1. Therefore, |sjo) £ Vj\ and \ sj\) G V%x, which implies 
that \t) = 7|sjo) + l\sji) G span(V 7 i U Vn). On the other hand, if x G / _1 (0), then \w') = |x)/7 is a 
witness for x. Indeed, (t\w'} = 1, {sjo\w') = if Xj = 1, and {sn\w ') = if Xj = 1 and i G N(j). 

From these witnesses, the witness size of P can be bounded easily. If x G then the witness 

stated above shows that the witness size for x is at most 2^ 2 = y/2(n + k). If x G / _1 (0), then the 
witness stated above shows that the witness size for x is at most (n + k)/^ 2 = ^/2{n + k). □ 

4 Preprocessing and overall algorithm 

In this section, we will prove Theorem Q] 

Consider the following quantum algorithm. 

1. Compute a*(G). Let s = max{a*(G), n}. (Because this step does not use any queries to x, 
how a* (G) is computed does not matter as long as the query complexity is concerned.) 

2. (Preprocessing.) Estimate the number t of pairs G [n] 2 such that ij G E and Xi = 1 by 
running the approximate counting algorithm in Corollary |6]with error probability e = 1/6 and 
precision parameter P = max{4, [7-^^/^]}. If the result t of counting satisfies t > 3s/2, then 
answer "yes" and halt. Otherwise, proceed to the next step. 

3. (Main case.) Run the algorithm in Lemma[7J(ii) using error probability 1/6 and parameter k = 2s, 
and answer "yes" or "no" accordingly. 



Let^ = C^°- 1 > n ,and let 




ze{o,i} 
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Query complexity. Step 1 does not make any queries to x. Step 2 makes O(P) = 0(y/n) queries. 
Step 3 makes 0(y/n + v2s) = 0{^+ y / a*(G)) queries. Therefore, the query complexity of the whole 
algorithm is 0(y/n + y/a*(G)). 



Correctness. In the rest of this section, we will show that this algorithm reports an incorrect answer 
with probability at most 1/3 by considering the following three cases: (a) the correct answer is "yes" 
and t < 2s, (b) the correct answer is "yes" and t > 2s, (c) the correct answer is "no." 

If the correct answer is "yes" and t < 2s, the only step where the algorithm reports an incorrect answer 
is step 3, and the promise of the algorithm in Lemma [7] (ii) is satisfied. Therefore, the error probability is 
at most 1/6. 

If the correct answer is "yes" and t > 2s, then with probability at least 5/6, t satisfies that 

t> t-\t-t\ 



2\J~2 7r i-^ 2vr 2 ., 
>t —Vtn 2 --^-n 



2s/2 r- 2 

> t v tn n 

-7 49 

2V2 _\ 2 



> Vt{yi —yfr 



49 n 



r—f r- 2^2 _\ 2 3 

Therefore, the algorithm reports "yes" in step 2 alone with probability at least 5/6. In this case, the 
promise of the algorithm in Lemma |7](ii) is not satisfied, but it does not matter what step 3 reports. 

Finally, consider the case where the correct answer is "no." In this case, both steps 2 and 3 can report 
an incorrect answer, and we will bound each of these probability by 1/6. The correct answer being "no" 
means that the set of marked vertices is an independent set of G, and therefore it holds that t < a*(G) < s 
by the definitions of a*(G) and s. This implies that with probability at least 5/6, t satisfies that 



t<t+\t-t\ 



2^/2-K r^: 2vr 



< t H vtn 2 + 



2 



n 



2 



P P 2 

2^2 r- 2 

<t-\ =—vtn + — n 

7 49 

2^/2 . 2 3 

<s + —V^s + -s<-s. 

Therefore, step 2 reports in an incorrect answer with probability at most 1/6. Moreover, because the 
promise of the algorithm in Lemma |7](ii) is satisfied, step 3 reports an incorrect answer with probability 
at most 1/6. By union bound, the overall error probability is at most 1/6 + 1/6 = 1/3. 



5 The case of random graphs 

In this section we analyze the query complexity of the graph collision problem defined over random 
graphs. Recall that we denote by the distribution of random graphs on n vertices, where every edge 
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is present with probability p, independently of other edges. 
We need the following combinatorial lemma. 

Lemma 8. For arbitrary p G (0, 1] and t > 40n log n, 



-t 2 



Pr [a* (GO > t] < n~ i4n + 2 exp , 

P roo/ Assume G ~ /i njP . For any i G N, let be the expected number of independent sets S C [n] that 
satisfy deg(S) > t. Clearly, 

Pr[a*(G) >t] <E[Y t ], 

and therefore we want an upper bound on E [Y t ]. 
For any i > 2 it holds that 

n\ (i) ( pi 2 
I ■ (1 — p) w < exp I ilogn — 

Let xo G [n] , to be fixed later. Then 

E [Y] < £ ( n ) • (1 - Pp ■ Pr [deg(,S) > t : |S| = *] 
i=l 

+ £ (:)-a-p)fi) 

i=x +l V 7 

< exp(a log n + 3mp — t) + exp (i log re — 

i=l i=xo+l 

where the last inequality follows from Lemma [3] 

Fix xq = min{ |_ J , n}. Then, noting i > 40n log n, it holds that 



exp ^ log n - ^-^ < 2 exp ^-^^ 



i=a: +l 



and we continue: 



E [Yt] < exp(log xq + xo log re + 3nxop — t) + 2 exp 
< exp + 2 exp 



PXp\ 



20 / V ^00n 2 p / 

The result follows. □ 
The theorem below and Corollary |2]follow immediately from Theorem Q] and Lemma[8] 

Theorem 9. There exists a universal constant C such that for any p G (0, 1], n G N arcc? £ > 40nlogn 
the following holds. For G ~ /J- n ,p, the probability that the (worst-case) quantum query complexity of the 
graph collision problem on G is greater than C(i/w + Vt) is at most re~ 14n + 2 exp( 2Q ~^ ). 
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6 Concluding remarks 



We gave a quantum algorithm for the graph collision problem on graph Gonn vertices whose query 
complexity is bounded as 0(y/n + yV(G)) in terms of the maximum sum of degrees of the vertices 
in an independent set of G. We used this to show that the graph collision problem has quantum query 
complexity 0(y/n) for almost all graphs if a graph is chosen at random so that each edge is present with 
a fixed probability independently of other edges. 

We conclude by stating a few open problems. Clearly improving the algorithm so that its query 
complexity becomes 0(y/n) for all graphs is an important open problem. As another direction, the graph 
collision problem can be defined also for hypergraphs, and it is used in an algorithm for the subgraph 
finding problem [MSS07], a natural generalization of the triangle finding problem. Extending the present 
algorithm to the case of hypergraphs is another open problem. 

Acknowledgments 

Dmitry Gavinsky is grateful to Ryan O'Donnell, Rocco Servedio, Srikanth Srinivasan and Li- Yang Tan for 
helpful discussions. The authors acknowledge support by ARO/NSA under grant W91 1NF-09- 1-0569. 

References 

[Amb07] Andris Ambainis. Quantum walk algorithm for element distinctness. SIAM Journal on Com- 
puting, 37(l):210-239, 2007. arXiv:quant-ph/0311001v8. 

[AS04] Scott Aaronson and Yaoyun Shi. Quantum lower bounds for the collision and the element dis- 
tinctness problems. Journal of the ACM, 51(4):595-605, 2004. arXiv:quant-ph/0112086vl. 

[BBBV97] Charles H. Bennett, Ethan Bernstein, Gilles Brassard, and Umesh Vazirani. Strengths and 
weaknesses of quantum computing. SIAM Journal on Computing, 26(5): 1510-1523, 1997. 
arXiv :quant-ph/9701 00 1 v 1 . 

[BBC + 01] Robert Beals, Harry Buhrman, Richard Cleve, Michele Mosca, and Ronald de Wolf. Quantum 
lower bounds by polynomials. Journal of the ACM, 48(4):778-797 ', 2001. arXiv:arXiv:quant- 
ph/9802049v3. 

[BDH + 05] Harry Buhrman, Christoph Diirr, Mark Heiligman, Peter H0yer, Frederic Magniez, Miklos 
Santha, and Ronald de Wolf. Quantum algorithms for element distinctness. SIAM Journal on 
Computing, 34(6): 1324-1330, 2005. arXiv:quant-ph/0007016v2. 

[BHT98] Gilles Brassard, Peter H0yer, and Alain Tapp. Quantum counting. In Automata, Languages 
and Programming: 25th International Colloquium, ICALP'98: Proceedings, volume 1443 of 
Lecture Notes in Computer Science, pages 820-831, 1998. arXiv:quant-ph/9805082vl. 

[BL1 1] Aleksandrs Belovs and Troy Lee. Quantum algorithm for fc-distinctness with prior knowledge 
on the input. arXiv:1108.3022vl [quant-ph], 2011. 

[DM05] Evgeny Drukh and Yishay Mansour. Concentration bounds for unigram language models. 
Journal of Machine Learning Research, 6(Aug): 123 1-1264, 2005. 



8 



[Gro96] Lov K. Grover. A fast quantum mechanical algorithm for database search. In Proceedings 
of the 28th Annual ACM Symposium on Theory of Computing (STOC), pages 212-219, 1996. 
arXiv:quant-ph/9605043v3. 

[JKM12] Stacey Jeffery, Robin Kothari, and Frederic Magniez. Improving quantum query complexity 
of Boolean matrix multiplication using graph collision. arXiv:1112.5855v2 [quant-ph], 2012. 

[KW93] Mauricio Karchmer and Avi Wigderson. On span programs. In Proceedings of the 8th Annual 
Structure in Complexity Theory Conference (CoCo), pages 102-111, 1993. 

[MSS07] Frederic Magniez, Miklos Santha, and Mario Szegedy. Quantum algorithms for the triangle 
problem. SI AM Journal on Computing, 37(2) :4 13^24, 2007. arXiv:quant-ph/0310134v3. 

[Rei09] Ben W. Reichardt. Span programs and quantum query complexity: The general adversary 
bound is nearly tight for every boolean function. In 50th Annual IEEE Symposium on Foun- 
dations of Computer Science (FOCS), pages 544-551, 2009. arXiv:0904.2759vl [quant-ph]. 

[Reill] Ben W. Reichardt. Reflections for quantum query algorithms. In Proceedings of the 
22nd Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 560-569, 2011. 
arXiv:1005.1601vl [quant-ph]. 

[RS08] Ben W. Reichardt and Robert Spalek. Span-program-based quantum algorithm for evaluating 
formulas. In Proceedings of the 40th Annual ACM Symposium on Theory of Computing 
(STOC), pages 103-112, 2008. arXiv:0710.2630v3 [quant-ph]. 



9 



